PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.004G037800.1.p
Common NameSb04g003240, SORBIDRAFT_04g003240
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family ARF
Protein Properties Length: 1071aa    MW: 119870 Da    PI: 6.4086
Description ARF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.004G037800.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B367.81.5e-2171172199
                           EEEE-..-HHHHTT-EE--HHH.HTT.......---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEE CS
                    B3   1 ffkvltpsdvlksgrlvlpkkfaeeh.......ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvF 82 
                           f+k+lt sd++++g +++p++ ae++       + +++  ++l+ +d + ++W++++i+r++++r++lt+GW+ Fv  ++L +gD+v+F
  Sobic.004G037800.1.p  71 FCKTLTASDTSTHGGFSVPRRAAEKIlppldfsM-QPPA-QELQARDIHDNVWTFRHIFRGQPKRHLLTTGWSLFVGGKRLFAGDSVIF 157
                           99*****************************954.4444.49*********************************************** PP

                           EE-SSSEE..EEEEE-S CS
                    B3  83 kldgrsefelvvkvfrk 99 
                              ++++++l+++++r+
  Sobic.004G037800.1.p 158 V--RDERQQLLLGIRRA 172
                           *..4577888****997 PP

2Auxin_resp117.51.1e-38197280183
            Auxin_resp   1 aahaastksvFevvYnPrastseFvvkvekvekalk.vkvsvGmRfkmafetedsserrlsGtvvgvsdldpvrWpnSkWrsLk 83 
                           aahaa+++s+F+++YnPras++eFv++++k++kal+ +++s+GmRf+m+fete+   rr++Gt++g+sdldpvrW+nS+Wr+L+
  Sobic.004G037800.1.p 197 AAHAAANNSPFTIFYNPRASPTEFVIPFAKYQKALYsNQISLGMRFRMMFETEELGMRRYMGTITGISDLDPVRWKNSQWRNLQ 280
                           79*********************************989********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019362.35E-4060200IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.102.2E-3964186IPR015300DNA-binding pseudobarrel domain
CDDcd100173.71E-1870162No hitNo description
SMARTSM010199.9E-2171173IPR003340B3 DNA binding domain
PfamPF023622.9E-1971172IPR003340B3 DNA binding domain
PROSITE profilePS5086311.47671173IPR003340B3 DNA binding domain
PfamPF065071.5E-33197280IPR010525Auxin response factor
PfamPF023091.2E-89351022IPR033389AUX/IAA domain
PROSITE profilePS5174528.739371021IPR000270PB1 domain
SuperFamilySSF542772.62E-79501016No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009734Biological Processauxin-activated signaling pathway
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0005515Molecular Functionprotein binding
Sequence ? help Back to Top
Protein Sequence    Length: 1071 aa     Download sequence    Send to blast
MQKDIDAHVP SYPNLPSKLI CLLHSVTLHA DPDTDEVYAQ MTLQPVNTYG KEALQLSELA  60
LKHARPQMEF FCKTLTASDT STHGGFSVPR RAAEKILPPL DFSMQPPAQE LQARDIHDNV  120
WTFRHIFRGQ PKRHLLTTGW SLFVGGKRLF AGDSVIFVRD ERQQLLLGIR RASRQPTNIS  180
SSVLSSDSMH IGVLAAAAHA AANNSPFTIF YNPRASPTEF VIPFAKYQKA LYSNQISLGM  240
RFRMMFETEE LGMRRYMGTI TGISDLDPVR WKNSQWRNLQ VGWDESAAGE RRNRVSMWEI  300
EPIAAPFFIC PQPFFGVKRP RQIDDESSEM ENLFKRAMPW LGEEICIKDA QTQNTTMPGL  360
SLVQWMNMNR QQSSTLANTG IQSEYLRSLS NPAMQNLGAA ELARQLYVQN HLLQQNSVQL  420
NASKLPQQMQ PINELAKGSL SCNQLDTITN HQELKQEVGN QQRQQQHINQ TIPLSQAQAN  480
LVQAQVIIQT QMQQQQQQQQ QQPSPTRCQQ GTSEQQLLLS QQHQDQNFQL QQQQQLLLQE  540
LQRQQQQNQQ QLNKLPGQLV NLAGQQAQLS DQELQLQLLQ KLQQQSLISQ PAVTLSRLPL  600
MQEQQKLLLD MQQLSSSHSL AQQRIMPQQD SKVSLQASQA PPTMKQEQQK LSQKQVALAN  660
VSDVAFQQIS STNVLSKAGS QLMIPGATQS VLTEEIPSCS TSPSTANNGN HLAHPTIGRN  720
EHCKVNMEKV PQSSALMSIP TSSEAVTTPI MMKESSKLNH NLKENVITSK SPTVGTGHDN  780
LLNIVPSTEN LETASSATSL WPTQTDGLLH QGFPTSNLNQ QQMFKDALAD VEIQEVDPTN  840
NAFFGINNDG PLSFPMETEG LLVSALNPVK CQTNLSTDVE NNYRIQKDAQ QEISTSMVSQ  900
SFGQSDIAFN SIDSAINDGA MLNRNSWPPA PPPQRMRTFT KVYKRGAVGR SIDIGRFSGY  960
EELKHALARM FGIEGQLEDR QRIGWKLVYK DHEDDILLLG DDPWEEFVNC VKCIRILSPQ  1020
EVQQMSLDGD LGNNVLSNQA CSSSDGGNAW KPRCDQNPGN PSIGFYDQFE *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ldu_A1e-141930196388Auxin response factor 5
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.210390.0callus| panicle
Functional Description ? help Back to Top
Source Description
UniProtAuxin response factors (ARFs) are transcriptional factors that binds specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs). {ECO:0000256|RuleBase:RU004561}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHM0045350.0HM004535.1 Zea mays auxin response factor 20 (ARF20) gene, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002453284.10.0hypothetical protein SORBIDRAFT_04g003240
SwissprotQ6Z2W30.0ARFE_ORYSJ; Auxin response factor 5
TrEMBLC5XUJ90.0C5XUJ9_SORBI; Auxin response factor
STRINGSb04g003240.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP140935106
Representative plantOGRP41931125
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G19220.10.0auxin response factor 19